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We propose a fully dynamic algorithm for maintaining reachability information in directed 
^v^ I graphs. The proposed deterministic dynamic algorithm has an update time of 0{(ins * n^) + 

(del * {m + n * log{n)))) where m is the current number of edges, n is the number of vertices 
in the graph, ins is the number of edge insertions and del is the number of edge deletions. 
Each query can be answered in 0(1) time after each update. The proposed algorithm combines 
existing fully dynamic reachability algorithm with well known witness counting technique to 

O'^ ■ improve efficiency of maintaining reachability information when edges are deleted. The proposed 

algorithm improves by a factor of 0( ^ "^^ . . ) for edge deletion over the best existing fully 
dynamic algorithm for maintaining reachability information. 






X 



Abstract 



> 

v/-^ ' 1 Introduction 

en I Transitive Closure problem for directed graph is well studied and motivated problem. Many appli- 

cations either can be represented using directed graphs or use directed graph as underlying data 
1^^ . structure. In reality for many of these applications, underlying directed graph changes dynamically 

C^ \ i.e. edges are inserted and/or deleted. This is a typical scenario in network routing, transportation 

and data mining. This leads to answering reachability queries for changing graph. As with any 
dynamic algorithm, we have two values in mind to understand efficiency of dynamic algorithm i.e. 
update time and query time. Update time is amount of effort required to handle changes to graph 
C^ ' while maintaining some information(reachability) related to the graph and query time is amount 

of effort required to answer (reachability) query on current graph after each update. Dynamic al- 
gorithms for maintaining reachability information on directed graphs has taken two paths in recent 
past. The trade off between these two approaches lies in update time vs. query time. First set of 
algorithms maintain reachability information explicitly in terms of transitive closure matrix where 
as the second set of algorithms don't explicitly maintain reachability information in the form of 
transitive closure matrix. First set of algorithms will have a query time of 0(1) as each query is a 
look up operation on the transitive closure matrix which is 0(1) after each update. 

Assuming reachability information is maintained explicitly in a transitive closure matrix then 
insertion or deletion of an edge operation may change 0(?7-^) entries in the transitive closure matrix. 
Therefore an amortized update time of O(n^) is essentially optimal for insert or delete operation. 



*The views expressed in this paper are the author's own and do not necessarily represent the views of Microsoft 



This optimal result is achieved by Rodittv [Rod03| . Demetrescu and Italiano [DIOO| as their algo- 
rithms maintain transitive closure of directed graph with an amortized update time of 0{n'^) by 
improving the algorithm proposed by King |Kin99j . In both these optimal results, each query can 
be answered in time 0(1) as transitive closure is explicitly maintained for the directed graph in 
consideration. Also both these algorithms support extended insert (may insert a set of edges all 
touching one vertex) operation and fully extended delete operation where completely arbitrary set 
of edges may be deleted from the graph in one delete operation. 

The second set of algorithms, where transitive closure matrix is not explicitly maintained, 
motivated by the fact that if number of queries after each update operation is relatively small, 
then we can have a dynamic algorithm with a smaller update time at the expense of a larger query 
time for maintaining reachability information. Roditty and Zwick [RZ0 2j proposed a fully dynamic 
algorithm of this type where update time of the algorithm is 0{my/n) and answers queries in 0{^/n) 
time. They achieve this by improving the results of Henzinger and King |HK95j . Demetrescu and 
Italiano [DIOO| obtained a dynamic algorithm based on fast matrix multiplication whose amortized 
update time is 0(n^'^'^^) and query time is 0{iiP'^'^^). Their algorithm [DIOOJ is a randomized 
algorithm having one sided error and only works for directed acyclic graphs(DAGs). 

Roditty and Zwick [RZ04| obtained an algorithm for maintaining reachability information with 
an amortized almost linear update time with respect to the number of edges and vertices in the 
current graph i.e. 0{m+n*log{n)) while it can answer queries in 0{n) time after each update. They 
obtained this algorithmic result by proposing an efficient way of maintaining strongly connected 
components of a directed graph which supports an interesting persistence property. Their algorithm 
is the first algorithm that breaks the 0{'n?') update barrier for all graphs with o{'n?) edges with 
query time equal to 0{n). 

The proposed algorithm has an update time of 0{{ins * v?) + {del * {m + n * log{n)))) where m 
is the current number of edges, n is the number of vertices in the graph, ins is the number of edge 
insertions and del is the number of edge deletions. Each query can be answered in 0(1) time after 
each update. Proposed algorithm combines algorithm presented in [RZ04J with well known witness 
counting technique to improve update time for edge deletion form 0{v?') to 0{m + n * log{n))) 
while answering queries in 0(1) time instead of 0{n). 

The proposed algorithm and some of the existing algorithms for dynamic reachability problem 
are compared in Table [TJ Rest of this paper is organized as follows. In the next section we 
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General 
General 
General 
DAGS 
General 
General 
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Monte Carlo 

Deterministic 
Monte Carlo 
Monte Carlo 
Monte Carlo 

Deterministic 


0{m * ^/n * log (n)) 

0{m * ^/n) 

Oim"-^^ * n) 
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0{m * ypn * log (n)) 

0(m * ^) 

©(m^'S* * n) 

0{m + n * log(n)) 


0(1) 
0{n/log{n)) 

O(n0-5^5) 
O(n0-5^5) 

0{n) 


DIOO RodOS] 
HK95] 
;RZ02] 
|RZ02 
IDIOO 

!San04] 
RZOll 


General 


Deterministic 


0{n') 


0{m + n * log{n)) 


0(1) 


In this paper 



Table 1: Fully Dynamic Reachability Algorithms 



introduce required concepts which are going to be used in the proposed algorithm and review 
some of algorithms which are going to be refined in later sections. In Section 3 we lazily update 
transitive closure using well known witness counting technique. Section 4 analyzes time complexity 



of proposed algorithm. We end this paper by concluding remarks and open problems in Section 5. 

2 Background 

Let G{V, E) be a directed graph where V is set of nodes of the graph and E is set of edges of the 
graph where (t", u) G -E iff w, u G y and there is a direct edge between nodes v, u in G. Each node 
in the graph is given unique value whose range is between 1, \V\. We most of the time use this 
value as the node itself when considering data structures. Most of other definitions and notations 
are borrowed from Roditty and Zwick [RZ04j . 

2.1 Graph Sequence 



Insert(^ ): t <- t + l,Et ^ Et-i U E 
Delete(£;'): Ej ^ Ej - E' , for 1 < j < t 



Figure 1: Insert and Delete operations on graph sequence for graph G{V,E) 

Since we are dealing with dynamic problem i.e. maintaining reachability information when 
G{V, E) under goes edge insertion and deletions, we define the following sequence on the graph 
Giy, E). Let Gi, G2, ..., Gt denote sequence of graph versions of GiV^ E) which has undergone edge 
insertions and deletions where t is the number of insertions performed so far. Gi{V,Ei) is created 
by i''^ insert operation where <i <t. Let us assume Go{V,Eq) as empty graph i.e. Eq = (j) with 
out loss of generality. The creation of a new version in the graph sequence only happens due to 
edge insertion. 

We will never create a new version of the graph for a delete operation. During delete operations, 
we will update all the versions of the graph by deleting edges if they are present in that particular 
version of the graph i.e. if we want to delete edge {v, u) and if (f , u) £ Gi where 1 < i < t, then we 
delete edge {v,u) from Gi{V,Ei). 

Proposed algorithms can support extended insert and extended delete operations where each 
extended operation is rooted at a particular node v of the graph G{V, E). We introduce the notion 
of time line for the graph sequence in order to explain algorithm and concepts in an efficient 
manner. Each update operation on the graph increases time line of the graph by one. This time 
line is denoted by Tlinec- Note that an insertion center G{V, Ei) may be inserted during time line 
k where k > i. This time line is not used in proposed algorithms but used to simplify proofs and 
explanation. 

2.2 In/Out Trees and Decremental Maintenance of In/Out Trees 

The proposed fully dynamic algorithms work on the ability of maintaining reachability trees for 
each version of the graph rooted at an extended insert operation root. 

Definition 2.1 A reachability tree is used to maintain the set of vertices that are reachable from 
a certain vertex r of a graph G = {V, E) that undergoes a sequence of edge deletionsfRZO^ . The 
nodes of the tree are not individual vertices of G but rather strongly connected components of G. 
The root of the tree is the component containing r. 



In[u\ denotes the reachability in tree rooted at node u i.e. In[u] is a tree containing all the nodes 
in G{V,E) which can reach node u. We similarly denote Out[u] as the reachability out tree rooted 
at node u i.e. Out[u] is a tree containing all the nodes of G(y,E) which are reachable from node 
we have the following theorem which is one of the ingredients of the proposed 
is true if u G In[r] and similarly ReachOut{u,r) is true if 



u. From[RZO^ 

algorithm. We say Reachln{u, r 

u £ Out[r] otherwise false. 



Theorem 2.1: The total time needed to maintain a reachability in(out)tree rooted at r is only 
0{m+n*log{n)) when the graph G{V, E) under goes a sequence of edge deletions whereas the queries 
ReachIn{u,r){ReachOut{u,r)) can be answered in 0(1) time after each edge deletion |RZ04) . 

In order to answer reachability query on the current version of the graph G{V,Et), we use the 
following approach. Query{v, u) is true iff 3rj such that v G In[ri] and u E Out[r,j\ where tj denotes 
insertion center for i^^ insertion operation and < i < t. Otherwise Query{v, u) is false. We can 
answer reachability queries this way on current version of the graph G{V,Et). Since there can be 
at most 0{n) insertion centers for any arbitrary sequence of edge insertions and edge deletions, we 
can have 0{n) as query time after each update. In the next section we show that using the concept 
of lazy update how to reduce query time after each update. 



3 Counting Technique 

For time being, let G{V, E) be any graph that is undergoing a sequence of edge deletions only. We 
say a node v is part of In[r] iff Reachln{v, r) is true in the current version of the graph. Similarly 
we say a node v is part of Out[r] iff ReachOut{v, r) is true in the current version of the graph and 
is false otherwise. 

Observation 3.1 Let r^ he an insertion center during time line k where k > i and ri becomes 
again an insertion center at time line l{> k). Let us say a node v G /n[rj] at time line k then v will 
be deleted only once from In[ri] before time line I. This happens only once when the node v is not 
reachable from node ri where r^ is i*^ insertion center and < i < t. Similarly this is applicable to 
Out arrays. 

We use above observation 13.11 and maintain transitive closure matrix TCM. Using TGM, we can 



1 


Input: v,u 


2 


if TGM[v][u] >Othen 


3 


return true 


4 


else 


5 


return false 


6 


end if 


Table 2: Query{v, u) 



answer Query (v, u) based on TCM and is shown in Algorithm [2j Query (v, u) = 1 ii TGM[v] [u] > 
otherwise 0. Note that TCM is not a binary matrix but contains number of insertion centers who 
can witness for a path directed from node v to node u when we are looking at rCM[T;][M]. Note 
that there can not be a node z, which is not an insertion center, can witness a directed path from 
a node to another node as there will not be outgoing edges from node z. If not, then z must be 



a insertion center at some point along time line. In the next few paragraphs, we show how to 
maintain witness count after each update. We refer to term "witness count" as number of insertion 
centers witnessing a directed path between two nodes in consideration. 

3.1 Handling Insertions 



1: Input: Ey 




2: if f is a previous insertion center then 




3: for each u in In[v] do 




4: for each z in OMt[w] do 




5: TCM[u] [z] = TCM[u] [z] - 1 




6: end for 




7: end for 




8: end if 




9: compute new In and Out trees rooted at node v 


using algorithm mentioned 


in [RZ()4 




10: for each u in In[v] do 




11: for each z in Out[v] do 




12: TCM [u] [z] =TCM [u] [z] + 1 




13: end for 




14: end for 




Table 3: Insert{Ey) 





We handle insertion as follows. Let us say we want to insert a set of edges, Ey, centered at node 
V. Let us assume v is not a insertion center before. First step of algorithm is to compute In[v\ and 
Out[v\ for current graph using algorithm mentioned in [RZ04J so that both In[v\ and OMt[f] can 
be maintained when edges are deleted from graph G{y,E) in 0{m + n * log{n)) time where m is 
current number of edges in the graph and n is number of vertices in the graph. Using In[v\ and 
Out\v\, we update witness count as follows. For each node u € In[v\ and z G Out[v\, we know that 
node V witness a path from node u to node z. So we update witness count for a directed path from 
node u to node z by increasing TCM[t(][z] by one. 

Now we remove assumption that v is not a insertion center before. To handle this case, just 
before computing In[v\ and Out\v\, we remove node f as a witness as we are going to compute 
from scratch when node v acts as an witness. So we decrement witness count by one for each pair 
{v,u) where v G In\v\ and u E Out[v\. This we perform before we compute new In[v\ and Out[v\. 
Algorithm shown in Algorithm [3] shows how to update TCM after inserting a set of edges, Ey, 
centered at node v. 

Lemma 3.1: Algorithm shown in Algorithm [3] correctly updates TCM where TCM[n][2;] contains 
number of insertion centers witnessing a directed path from node u to node z after inserting a set 
of edges, Ey, centered at node v. 

Proof: We have two cases. 



1. Let us assume v is not an insertion center before. We know for each u G In[v\ and z £ Out[v], 
there is a directed path from node u to node z via node v. So node v becomes a witnessing 



insertion center for a directed path from node u to node z. This is exactly what is performed 
by algorithm shown in Algorithm [3] by incrementing TCM[n][z] with one. 

2. Let us say v is an insertion center before. Let Inoiii[v] and Outoid[v] denote In and Out trees 
rooted at node v just before inserting edge set Ey. Innewiv] and Outnew[v] denote In and 
Out trees rooted at node v after inserting edge set E^. We know that we need to update 
TCM[u\[z] if only if 

(a) u G lunewM - Irioldiv] and Z € OutnewM 

(b) u £ lUoidM and z £ OutnewM - Outoid[v] 

This is true as inserting edge set E^, can only create additional paths between nodes which 
were not present in Inoid[v] and present in Innew[v] to any node in Outnew[v]- Similarly we 
can have new path between nodes which were present in Inoid[v] and to nodes which are 
presented in Outnew[v] but not in Outoid[v]- Note that with out these new paths, node v can 
not act as insertion center witnessing between above stated combination of nodes. 

This is exactly performed in algorithm shown in Algorithm [3] but in a different way by first 
decrementing TCM[u][2:] for each u G Irioidlv] and z S Outoid[v] and then incrementing 
TCM[w][z] for each u G Innew[v] and z G Outnew[v] but both operations are equal. 



3.2 Handling Deletions 



Input: Ey 

for each version of graph Gi rooted at z do 

update In[z] and Out[z] using algorithm mentioned in |RZ04j 

denote Indeietei^] ^^ nodes which are deleted from In[z] due to deletion of edge set Ey. 

denote Outdeiete[z] as nodes which are deleted from Out[z] due to deletion of edge set Ey 
end for 
Call UpdateTCM 

Table 4: Delete{Ey) 



We handle edge deletions as follows. Let us say we want to delete a set of edges, Ey, centered 
at node v. We know that node v is an insertion center. Let us assume that In[v\ is set of nodes 
which can reach node v and Ont[7;] is set of nodes which can be reachable from node v just before 
deleting edge set Ey. Let Inupdate[v\ denotes set of nodes which can reachable from node v and 
Outupdate[v\ is Set of nodes which can be reachable from node v after deletion of edge set Ey. 

Irideletelv] = In[v\ - lUupdatelv] 
OutdeleteM = Out[v] - OutupdateM 

Indeiete b] and Outdeiete [v] denotes set of nodes whose reachability got effected by deletion of edge 
set Ey with respect to node v. We use following observations to handle edge deletions. 



Observation 3.2 Let us assume u E In[w]. After deleting edge set E^, node v will not act as a 
witness for directed path from node u to node z any more iff u £ ludeiete [v] and z £ Out [v] . 

Observation 3.3 Let us assume z £ Out[v]. After deleting edge set E^, node v will not act as a 
witness for directed path from node u to node z any more iff u £ In[v\ and z £ Outdeiete[v\- 

Given ludeietelv] and Outdeieteiv]-, we can update TCM using above two observations. Algorithm 
shown in Algorithm U] shows how to update TCM after deleting a set of edges, E^, centered at node 
V. First step of algorithm is to compute Indeiete[v] and Outdeiete[v] which can be easily obtained by 
simple modification to algorithm presented in |RZ04| for maintaining In and Out trees rooted at a 
node V when graph is undergoing edge deletions. We use above two observations and update TCM 
by just one iteration over /n^^eZete [^] > In[v], Out[v] and Outdeieteiv]- This is handled in algorithm 
UpdateTCM shown in Algorithm [H 



1: for each version of graph d rooted at 


z do 


2 


for each u in ludeieteiz] do 




3 


for each / in Out[z] do 




4 


TCM[u][l] = TCM[u][l] - 1 




5 


end for 




6 


end for 




7 


end for 




8 


for each version of graph Gj rooted at 


z do 


9 


for each u in Outdeiete[z] do 




10 


for each / in In[z] do 




11 


TCM[l][u] = TCM[l][u]-l 




12 


end for 




13 


end for 




14 


end for 




Table 5: UpdateTCM 





Lemma 3.2: Algorithm shown in Algorithm Incorrectly updates TCM where TCM[u] [z] contains 
number of insertion centers witnessing a directed path from node u to node z after deleting a set 
of edges, E^, centered at node v given ludeieteiz] and Outiieiete{z] for each insertion center z. 

Proof: Let I be any insertion center and is witnessing a directed path from node u to node z 
before deletion of edge set Ey. This witness will cease if u G Indeiete\P[ and z £ Out[l] as u can not 
reach node / after deletion of edge set E.^. This is even true in case u £ In\l\ and z £ Outdeiete[l] 
as z can not be reached from node / after deletion of edge set E^. I will remain as witness when 
u £ Inupdate[l] and z £ Outupdate[l]- This is exactly performed in algorithm shown in Algorithm [H 



We combine algorithm [2l [3] and algorithm [J] to obtain a fully dynamic algorithm for maintaining 
transitive closure. We can obtain Indeiete[z] and Outdeiete[z] for each insertion center after edge 
deletion by trivial modification to algorithm presented in |RZ04| for decremental maintenance of 
In and Out trees rooted at node z. 



4 Time Complexity 

In this section, we analyze time complexity of algorithm for maintaining transitive closure presented 
in previous section when graph is under going edge insertions and deletions. 

Lemma 4.1: Time complexity of Algorithm shown in Algorithm [5] is O(n^). 

Proof: There can be at most n nodes in In[v] and at most n nodes in Out[v]. Therefore there 
can be at most v? updates to TCM where v is not an insertion center before. If v is an insertion 
center before, then at most n^ entries will be decremented in TCM and at most n^ entries will be 
incremented for new insertion center rooted at node v. Therefore proves theorem. ■ 

Lemma 4.2: For an insertion center v, number of entries changed in TCM while maintaining In 
and Out trees rooted at node v is 0{n'^). 

Proof: There can be at most n nodes in In tree and at most n nodes in Out tree rooted at node v 
when node v becomes an insertion center. After node v became an insertion center and just before 
it again becomes an insertion center along time line, nodes will disappear from In and Out tree 
rooted at node v as there can be only deletion of edges from In and Out tree rooted at node v. In 
worst case, there will not be a singe node in both In and Out tree just before node v again becomes 
an insertion center. Therefore node v will cease to witness at most n^ pairs directed path via node 
V. Therefore proves theorem. ■ 

Lemma 4.3: Let v be any insertion center, then computing Irideieteiv] and Outdeieteiv] while 
maintaining In and Out trees rooted at node v takes at most 0{m+n*log{n)) time using decremental 
maintenance of In and Out trees algorithm mentioned in |RZ04j 

Proof: From |RZ04] . we know that In and Out trees can be maintained decrementally rooted at 
an insertion center in 0{m + n*log{n)) time. We can easily obtain Indeiete[v] and Outdeiete[v] after 
each edge deletion by simply denoting nodes explicitly which are not present in both In and Out 
trees after each edge set deletion but are present before edge set deletion. ■ 

Lemma 4.4: Combining algorithm [21 [3] and algorithm [H we obtain a fully dynamic algorithm 
for maintaining transitive closure with 0(1) query time where insertion update time and deletion 
update time are 0{'n?) and 0{m + n * log{n)) respectively. 

Proof: From lemma 4.1, we know that time taken to maintain TCM when we insert a set of edges 
rooted at node v is 0{'n?) and also decremental maintenance of In[v\ and OMt[7;] trees rooted at 
insertion center v is 0{m + n*log{n)) which includes cost for providing Indeiete\v\ and Outdeiete\v\ 
after each update. We know from lemma 4.2, there can be at most 0{'n?') updates to TCM for an 
insertion center v where In[v\ and Ont[u] are undergoing edge deletions only. This cost is amortized 
onto insertion. Therefore proves theorem. ■ 



5 Future work and Conclusions 

In this paper we have proposed an algorithm for maintaining reachability information when graph 
is undergoing edge insertions and deletions. The algorithm has an insertion update time of 0{'n?') 



and a deletion update time of 0{m + n* log{n)). Queries can be answered in 0(1) time after each 
update. This is the first deterministic algorithm to have linear update time for edge deletion while 

2 

answering queries in 0(1) time. The proposed algorithm improves by a factor of 0( ^ "^^ /C ) over 
existing best fully dynamic algorithm for maintaining reachability information. Reducing insertion 
time while maintaining same deletion update time and query time is a major open problem. 

References 

[DIOO] Camil Demetrescu and Giuseppe F. Italiano. Fully dynamic transitive closure: Breaking 
through the o{'n?) barrier. In IEEE Symposium on Foundations of Computer Science, 
pages 381-389, 2000. 

[HK95] Monika Rauch Henzinger and Valerie King. Fully dynamic biconnectivity and transitive 
closure. In FOCS, pages 664-672, 1995. 

[Kin99] Valerie King. Fully dynamic algorithms for maintaining all-pairs shortest paths and tran- 
sitive closure in digraphs. In FOCS '99: Proceedings of the 40th Annual Symposium on 
Foundations of Computer Science, page 81, Washington, DC, USA, 1999. IEEE Computer 
Society. 

[Rod03] Liam Roditty. A faster and simpler fully dynamic transitive closure. In SODA '03: 
Proceedings of the fourteenth annual ACM- SI AM symposium on Discrete algorithms, pages 
404-412, Philadelphia, PA, USA, 2003. Society for Industrial and AppUed Mathematics. 

[RZ02] Liam Roditty and Uri Zwick. Improved dynamic reachability algorithms for directed 
graphs. In FOCS '02: Proceedings of the 43rd Symposium on Foundations of Computer 
Science, page 679, Washington, DC, USA, 2002. IEEE Computer Society. 

[RZ04] Liam Roditty and Uri Zwick. A fully dynamic reachability algorithm for directed graphs 
with an almost linear update time. In STOC '04: Proceedings of the thirty-sixth annual 
ACM symposium on Theory of computing, pages 184-191, New York, NY, USA, 2004. 
ACM Press. 

[San04] Piotr Sankowski. Dynamic transitive closure via dynamic matrix inverse (extended ab- 
stract). In FOCS '04: Proceedings of the 45th Annual IEEE Symposium on Foundations of 
Computer Science (FOCS'04), pages 509-517, Washington, DC, USA, 2004. IEEE Com- 
puter Society. 



